Discursive Sentence Compression

نویسندگان

  • Alejandro Molina
  • Juan-Manuel Torres-Moreno
  • Eric SanJuan
  • Iria da Cunha
  • Gerardo Sierra
چکیده

This paper presents a method for automatic summarization by deleting intra-sentence discourse segments. First, each sentence is divided into elementary discourse units and, then, less informative segments are deleted. To analyze the results, we have set up an annotation campaign, thanks to which we have found interesting aspects regarding the elimination of discourse segments as an alternative to sentence compression task. Results show that the degree of disagreement in determining the optimal compressed sentence is high and increases with the complexity of the sentence. However, there is some agreement on the decision to delete discourse segments. The informativeness of each segment is calculated using textual energy, a method that has shown good results in automatic summarization.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discursive Mining Viewpoints in Building Multi-Document Synthesized Sheets

Multi-documents sheets are viewed as semantically structured representations of textual documents. The automatic construction of these sheets is based on the automatic annotation of textual documents according to a set of discursive categories called discursive mining viewpoints. The automatic annotation of a text is performed using the Contextual Exploration processing. It is a linguistic and ...

متن کامل

Document-level translation quality estimation: exploring dicsourse an pseudo-references

Predicting the quality of machine translations is a challenging topic. Quality estimation (QE) of translations is based on features of the source and target texts (without the need for human references), and on supervised machine learning methods to build prediction models. Engineering well-performing features is therefore crucial in QE modelling. Several features have been used so far, but the...

متن کامل

Incompatibility, Modal Semantics and Intrinsic Logic

I closed my lecture last week with an argument building on the idea that every autonomous discursive practice, in order to count as a discursive or linguistic practice, in order to count as deploying any vocabulary, must include performances that have the pragmatic significance of assertions, which on the syntactic side are utterances of declarative sentences, and whose semantic content consist...

متن کامل

A Generative Approach for Multi-Document Summarization using Semantic-Discursive information

Multi-document summarization is the automatic production of a unique summary from a collection of texts. In this paper, we propose a statistical generative approach for multi-document summarization that combines simple information such as sentence position in the text and semantic-discursive information from CST (Cross-Document Structure Theory). In particular, we formulate the multi-document s...

متن کامل

Certamente and Sicuramente. Encoding Dynamic and Discursive Aspects of Commitment in Italian

Commitment should be understood as a dynamic and discursive category. This raises some important questions for the theory of grammar: to what extent do languages encode the dynamic and discursive aspects of commitment? At what level of analysis does this encoding take place? Which markers encode these aspects? In order to answer some of these questions two Italian adverbs expressing strong comm...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013